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Abstract 



I consider variations in the definitions, at next-to-leading order (NLO) and at next- 
to-next-to leading order (NNLO), of a General-Mass Variable Flavour Number Scheme 
(GM-VFNS) for heavy flavour structure functions. I also define a new "optimal" scheme 
choice improving the smoothness of the transition from one flavour number to the next. 
I investigate the variation of the structure function for a fixed set of parton distribution 
functions (PDFs) and also the change in the PDFs when a new MSTW2008-type global fit 
^sO | to data is performed for each GM-VFNS. At NLO the parton distributions, and predictions 

using them at hadron colliders, can vary by ~ 2 — 3% from the mean value. At NNLO 
there is far more stability with varying GM-VFNS definition, and changes in PDFs and 
predictions are less than 1%, with most variation at very small x values. Hence, mass- 
scheme variation is an additional and significant source of uncertainty when considering 
parton distributions, but as with all perturbative uncertainties, it diminishes quickly as 
higher orders are included. 



The treatment of heavy flavours in structure functions has a significant effect on the parton 
distribution functions (PDFs) obtained in fits to structure function, and other data, and conse- 
quently on the predictions for cross sections at hadron colliders such as the LHC and Tevatron. 
The up, down and strange quark are regarded as "light" quarks, and are always treated in the 
massless approximation since their current mass is much smaller than the regimes in which we 
use perturbative QCD. However, both the charm quark mass m c and bottom quark mass m;, 
are much larger than Aqcd and so we can treat the generation of the charm and bottom parton 
distributions as a perturbative process. The details of the description of these heavy flavours, 
charm and bottom, in structure functions has an important impact on the PDFs extracted in 
fits, due both to the existence of direct data on F£(x, Q 2 ), where h represents the heavy quarks 
charm (c) and bottom (b), and also due to the contribution to the heavy quarks to the total 
structure function, particularly at small x. Indeed, using the published data [1J from the HERA 
experiment the latter of these is probably more important. 



Traditionally there are two distinct regimes for heavy quark production using quite distinct 
theoretical descriptions. For Q 2 ~ m\ massive quarks are thought of as being created in the 
final state, and described using the Fixed Flavour Number Scheme (FFNS) (see [2] for NLO 
results), 

F(x,Q 2 ) = CF*{C?/ml) ® /^(g 2 ), (1) 

where rif is the number of light quark flavours. This does not sum agin™ Q 2 /m\ terms in the 
perturbative expansion, the consequence of which may well limit accuracy, though this is still 
a matter of debate. At high scales, Q 2 ^> m\, heavy quarks behave like massless partons. The 
large logarithms mentioned can then be automatically summed via the solution of evolution 
equations for the heavy quark parton distributions. The distributions for different light quark 
number are related to each other via the perturbative expression 

f? s+ \&) = MAl™l)®f n k t ii4l (2) 

where the matrix elements Aj k (fi 2 F /m 2 H ), calculated at 0(a 2 s ) in [3] (and where calculations are 
ongoing at C?(a|) [HOE]), contain the fixed-order \n(^ 2 F /m 2 ) contributions. These expressions 
can be used at an appropriate choice of the factorisation scale, usually fip = vr? h to obtain the 
boundary conditions for the evolution of the heavy flavour distribution, and simultaneously the 
change in the light parton distributions at this "transition point". In the Q 2 /m 2 L — > oo limit the 
description becomes the Zero-Mass Variable Flavour Number Scheme (ZM-VFNS), 

F(x, Q 2 ) = Cf UVF ^ +m ® f- f+m (Q 2 ), (3) 

where m is the number of heavy quarks which have effectively become light quarks. Although 
called a "scheme" this is really an approximation until one reaches the real asymptotic limit 
since it ignores all 0{m 2 h /Q 2 ) corrections. 

Various studies of comparisons between the FFNS and ZM-VFNS have been made. For the 
currently most obvious case of neutral current F£(x, Q 2 ) it seems that the difference at NLO at 
high Q 2 can be significant, but not enormous, see e.g. (HO [9]. However, there are cases where 
the difference is more obvious, e.g. F£(x, Q 2 ) in charged-current DIS [HI [11], and for example 
there is a distinct difference between the results using the two methods of calculation for the 
6-quark associated contribution to the Z cross-section in hadron-hadron colliders [12J and for 
the neutral supersymmetric Higgs boson [13] , particularly for the largest boson masses. It seems 
natural to assume that the ZM-VFNS is more precise at very large scales, where resummation 
of large logarithms is most urgent, and that the FFNS is more precise for scales near m^, where 
the ZM-VFNS is only an approximation. In order to correct this shortcoming in the ZM-VFNS 
but also sum the logarithms in Q 2 /mf L (via explicit heavy quark parton distributions), and hence 
to obtain the most effective description between the two limits of Q 2 < m 2 H and Q 2 3> m 2 H , one 
must use a General-Mass Variable Flavour Number Scheme (GM-VFNS). 

A GM-VFNS is defined similarly to the ZM-VFNS, i.e. the structure function is written as 

F(x, Q 2 ) = Cf MYF ^ +m (Q 2 /m 2 h ) ® /; /+m (Q 2 ), (4) 
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where now the coefficient functions are dependent on Q 2 / m L but reduce to the zero mass limit 
as Q 2 /m\ — > oo. If we consider the transition from nj active quarks to rif + 1 then we can write 

F(x,Q 2 ) = Cf MYY ^ + \Q 2 /ml)®f r ;i + \Q 2 ) 

= Cf M ^ + \Q 2 /ml) ® A jk {Q 2 /m 2 h ) ® f k \Q 2 ) = C?* \Q 2 /m\) ® f^(Q 2 ), (5) 

and the GM-VFNS can be defined from the formal equivalence of the rif flavour and rif + 1 
flavour descriptions at all orders, resulting in 

C F k F ' ni (Q 2 /ml) = Cf M ^ + \Q 2 /ml) ® A jk (Q 2 /m 2 h ), (6) 

where for simplicity I have set /Mp = Q 2 . The fact that Eq.(T5]) correctly converts rif flavour 
PDFs to rif + 1 flavour PDFs guarantees that at Q 2 /m\ — )> oo, where all power-suppressed 
m\jQ 2 corrections become negligible, the GM-VFNS coefficient functions become identical to 
the ZM-VFNS coefficient functions. 

In order to illustrate the manner in which the GM-VFNS works we see that at 0(as) Eq.([6]) 
results in the equivalence 

cZ^iQVml) = c^ F ' nf+w \Q 2 /<) ® pSMQVO + c^ F ' ni+m (Q 2 /^ (7) 

(where (n) represents the nth order in as term of the coefficient function), which defines the 
GM-VFNS coefficient functions. As stated, the coefficient functions must tend to the massless 
limits as Q 2 jm\ — > oo, and Eq.([7|) is consistent with this. However, Cf M ^ ¥ (Q 2 /m 2 /) is only 

uniquely defined in this limit. One can swap 0{m\/Q 2 ) terms between Cf^ Fm {Q 2 /m\) and 
^GMVF(i)^ 2 ^ m 2-j j n an( j ma k e similar swaps at higher order, i.e. as pointed out in [5] 

Eq. §6§ is just an equivalence which can be used in the definition of a scheme, but has no unique 
solution since there are more free variables on the right-hand side of the equation than there are 
equations. (A similar observation was made independently in a slightly different context in [14J , 
and presented very clearly and used to define a particularly efficient scheme in [15].) To show 
how the independence under changes of order 0(m 2 h / Q 2 ) works let us consider the expression 
for F£ h up to 0(a s ). 

*t = C^ +1 ' { °\Q 2 /m 2 h ) ®(h + h) + a s C^ l+1 ' il \Q 2 lrnl ®(h + h) 

+ «S (<f' (1) (Q 2 / m D - ^ST nf+1 * (0) (QVm 2 ) ® Pi H» 2 F /m 2 )) ® g, (8) 

and let us assume that this is a particular well-defined GM-VFNS. Now we consider making a 
redefinition 

C™ VP '°' +1 ' (0) W7miD - C™r* ,+1 - { °'(Q 2 /™l) + K$r«+»»«?/n$>, (9) 

where 5C^^ rF ' ni+1 '^°\Q 2 /m\) vanishes like a power of m — h 2 /Q 2 as Q 2 /m\ — > oo. Under such 
a change in the coefficient function the change in the structure function is 

SFf = 5C^ F ^°\Q 2 /m 2 h mh + h)-a s 5 (10) 



From the relationship between the rif + 1 and n/ PDFs in Eq.(j2]) one can easily show that up 
to 0(a s ), 5F£ h = (i.e. (h + h) = a s Pq g ln(/4/m|) <g> g + 0(a 2 s )). However, it is also useful 
to consider the renormalisation group equation in terms of factorisation scale Clearly a 
correctly defined structure function will have no dependence on /xp. Explicitly 

= <r' nf+i>(0 ^ 

- a s 5C^ F ^°\Q 2 /ml) ® F» ® 9 + 0(4), (H) 



and using the fact that (h + h)) starts at 0(as) we see that 

It can be shown explicitly that this feature exists to higher orders, but it is guaranteed from 
the definition Eq.flf)]) to work at all orders. At a finite order there is not exact cancellation, but 
the dependence on the choice of e.g. 5C^^ F ' nf+1 '^ m \Q 2 /ml), will be higher order than that to 
which the structure functions are defined. For example, at NLO, where the 0{ots) coefficient 
functions are applied in the rif + 1 flavour regime, there is no cancellation in Eq.t flOi) between the 
NLO evolution of the charm distribution oc a s Pq g g) g in the direct heavy flavour contribution 
and the coefficient function combined with gluon distribution contribution. However, this occurs 
when the 0(a 2 s ) coefficient functions are introduced at NNLO, and the uncancelled contribution 
is pushed to the evolution term cx ot s P 2 g ® g. As we will see, this results in far more stable 
results under variation of GM-VFNS at NNLO than at NLO. 

Perhaps the nomenclature "scheme" is misleading, since it suggests that one compensates 
for a change in coefficient functions with a change in PDFs, e.g. as when transforming from 
the MS factorisation scheme to the DIS factorisation scheme. Here it is instead the case that 
because the heavy flavour is generated entirely from the gluon and light quarks via evolution 
(ignoring intrinsic flavour, which is discussed in [IB]) there is an inherent ambiguity in exactly 
which contribution can be attributed to the heavy flavour and which to the light PDFs since 
one can be expressed precisely in terms of the other. It is only the fact that the large \n(Q 2 /m 2 /) 
terms are resummed in the heavy flavour distribution by evolution that stops the invariance 
under change of GM-VFNS being exact at finite order. Expressing (h + h) at the same order in 
as as the coefficient functions, rather than as the full solution of the evolution equations would 
lead to exact invariance, but would be pointless as it would simply reproduce the results of the 
FFNS. 

It should be noted that the freedom to modify coefficient functions by power suppressed 
terms, so long as this modification is applied to the corresponding subtraction terms, occurs 
separately in each structure function or cross-section. One may even change the GM-VFNS for 
F 2 (x, Q 2 ) while leaving the one for Fl(x, Q 2 ) the same. The choice made in the fit to structure 
functions does not influence the choice made in predictions at hadron colliders, though at the 
moment most such predictions are at high enough scales that the zero-mass coefficients may 
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be used as a good approximation. This is certainly the case for data such as high-Er jets and 
W, Z production at the Tevatron and LHC currently used in global fits. The heavy flavour 
contribution to the cross section for fixed target Drell Yan data is extremely small so the total is 
insensitive to the heavy flavour prescription used. In principle some collider data would be best 
described using a GM-VFNS. There is no restriction on the form chosen from the manner in 
which the PDFs are fit (unless exactly the same data type is used in the fit, in which case there 
should be consistency). One would simply expect a sensible definition to be used for all data 
types. One clear criterion might be that all coefficient functions respect the correct kinematics 
to produce the physical final state containing the heavy quarks. 

The freedom to define different definitions of a GM-VFNS has resulted in the existence of 
various prescriptions [HJ [9j [T7J [18j [19] each with a particular reasoning for the choice. The original 
ACOT scheme [8] simply calculated the heavy quark coefficient functions for single massive 
quark production and defined these to be C 2 ^ F ' Ilf+1 ' , hence defining the scheme. However, this 
required explicit calculation and also did not impose the correct kinematical requirement that 
(in neutral current DIS) one must have enough energy to create a pair of massive quarks in 
the final state. The TR GM-VFNS [9] highlighted the freedom in choice, and enforced correct 
kinematics via a definition which demanded the continuity of the value of dF^/d In Q 2 across 
the transition point. However, this was only possible in the gluon sector beyond 0(cts) and was 
quite complicated. The (S)ACOT(x) prescription [18] applied the simple choice 

C 2jA F (Q 2 / m l, z)(x5{z- aw), (13) 

which gives 

it (0) (x,Q 2 ) oce 2 h (h + h)(x/x max ,Q 2 ), (14) 

where x max = Q 2 / (Q 2 + 4m 2 ), and imposes the threshold W 2 = Q 2 {1 — x)/x > 4m|. This gives 
the usual limit 

6(1 -z) (15) 

for Q 2 /mf l — > oo. The modified TR, or TR' scheme [19j adopted this and extensions to higher 
orders (though uses a different multiplicative factor of Q 2 / (Q 2 + 4m|) for the zeroth-order coef- 
ficient function [20J). Very recently the (S)ACOT(x) prescription has been extended explicitly 
to NNLO |2T]. 

However, there is a further, more significant difference between the TR' scheme and the 
SACOT(x) scheme. ACOT-type schemes have used the same order of as above and below the 
transition point Q 2 = ml, despite the fact that FFNS is LO at 0(o>s) while the ZM-VFNS starts 
at zeroth order. Instead the TR' definition uses, for example, at LO the 0(as) FFNS result for 
Q 2 < ml, and for Q 2 > ml 

F>\x,Q 2 ) = a s (m 2 )CZr {1 \Q 2 = <)®^\Q 2 = m 2 h ) + C^ F ^ \Q 2 /m 2 J^(h + h)(Q 2 ), 

(16) 

i.e. it freezes the higher order as term when going upwards through Q 2 = m\. This difference 
in choice can be phenomeno logically important in the vicinity of Q 2 = ml, though becomes less 
important at high Q 2 as the frozen term becomes smaller compared to the growing contribution 
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from the heavy quark distribution. The effect is quite significant at small x at NLO, since the 
0(a s ) contribution a s C F ^ u ^ <8> g nf is large for Q 2 ~ m 2 , as seen in Fig. 2 of [19] . 

There is also an alternative, but ultimately equivalent, way of formulating a GM-VFNS, 
where instead of having a transition point where one turns on heavy quark distributions and 
defining GM-VFNS coefficient functions one obtains the total structure function from a variety 
of contributions which cancel each other appropriately to obtain the correct limits in the low 
and high Q 2 regimes. The BMSN (Buza, Matiounine, Smith and van Neerven) [3] and FONLL 
(fixed-order next-to-leading log) [20] definitions for a scheme are based on the expression 

F GMVF (x, Q 2 ) = Fi F (x, Q 2 ) - Fr ymp (x, Q 2 ) + F 2 ZMVF (x, Q 2 ), (17) 

where generally speaking the second (subtraction) term is the asymptotic version of the first, 
i.e. all terms of 0(m\/Q 2 ) are omitted. There are differences in exactly how the second and 
third terms are defined in detail in different schemes. In the standard applications the as order 
of -F 2 FF (x, Q 2 ) at low Q 2 is the same as that of F 2 ZMVF (x, Q 2 ) as Q 2 ->■ oo. This type of scheme 
is used in [23] using 0(acs) coefficient function expressions along with LO PDFs, and 0(a s ) 
coefficient function expressions along with NLO PDFs. However, ultimately the FFNS scheme 
is used for structure functions in fitting to structure function data, 0(a s ) for both NLO and 
NNLO fits (approximate 0(a s ) coefficient functions, and a running quark mass, are used in 
[23] and [25]). The FONLL approach, which modifies F 2 asymp (x, Q 2 ) and F ZMVF (x,Q 2 ) by, for 
example, the replacement of x by x m order to maintain correct kinematics in each term, is used 
at NLO in [26] and at LO and NNLO in [27]. By default the order of as is the same in each of the 
three terms and the GM-VFNS uses 0(a s ) coefficient function expressions along with LO PDFs, 
0{ots) coefficient function expressions along with NLO PDFs and 0(a s ) coefficient function 
expressions along with NNLO PDFs. There is a version of FONLL , FONLL-B, which uses one 
power higher in the FFNS and subtraction term, but where the subtraction term contains only 
the \n(Q 2 /m 2 h ) terms, not the finite part, so that at Q 2 = m 2 h , F GMVF (x,Q 2 ) = F^ F (x,Q 2 ). 
This reproduces the correct NLO FFNS at Q 2 near to threshold, but unlike the case where the 
coefficient functions are 0(as) in all pieces it leads to part (but not all) of the higher order 
NNLO contribution persisting as Q 2 — > oo, i.e. F FF (x, Q 2 ) and F 2 asymp (a;, Q 2 ) do not cancel in 
this limit so the exact F 2 ZMVF (a;, Q 2 ) is accompanied by a formally subleading term, reminiscent 
(but different in origin and form) of the frozen contribution in the TR' approach. 

Ideally one would like any GM-VFNS to reduce to exactly the correct order FFNS at low Q 2 
and exactly the correct order (one power of as lower) ZM-VFNS as Q 2 — >• oo. At present, as 
outlined above, none do. However, in the case of the TR' scheme this can easily be rectified. The 
fact that the higher-a^ order term is frozen for Q 2 > m 2 h was strictly necessary in the original 
TR scheme in order to impose exact continuity of dF^/d In Q 2 . However, this is no longer a 
requirement, so rather than this term being frozen it could instead be of the form 

(m 2 h /Q 2 ra n s(m 2 h )^C!J(m 2 h )®Mm 2 h ) or {ml/Q^a^Q 2 )^ C F 2 F (Q 2 )^(Q 2 ). (18) 

Any a > provides the correct asymptotic limit, though strictly from factorization one should 
have {m 2 H /Q 2 ) times \n(Q 2 /m 2 H ) terms. Fractional values of a could be thought of as mimicking 
the power times logarithmic corrections. As well as modifying the previously frozen term in this 
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Table V. The values of parameters defined in Eqs. (11811191120]) for different extreme GM-VFNS 
definitions. 



manner there is also more freedom in the definition of the GM-VFNS than previously used or 
explored. Indeed, this freedom is an inherent feature of a GM-VFNS, so its consequences should 
be investigated quantitatively. 

One can modify the heavy quark coefficient function so long as the correct Q 2 jm\ — > oo limit 
is maintained. However, since this appears in convolutions for higher order subtraction terms, 
it is desirable to avoid a complicated x dependence. A simple choice is 

c GMVF,(o )(g2/m 2 )Z) ^ (l + b(ml/Q 2 ) c )6(z-x max ), (19) 

where again variation in c really mimics (m\/Q 2 ) with logarithmic corrections. One can also 
modify the argument of the 5-function, similar to the Intermediate-Mass IM scheme [22J , we can 
define 

f = x/ Xmax -> x{\ + (x(l + 4m 2 /Q 2 )) d 4m 2 /Q 2 ), (20) 

so the kinematic limit stays the same, but if d > (< 0) small x is less (more) suppressed. The 
default a, b, c, d are all zero, but can vary, being limited by fit quality or sensible choices, e.g. we 
would not choose b < — 1, since this would give a negative zeroth-order heavy quark coefficient 
function near Q 2 = m\. There is also a potential source of uncertainty from variation of the 
transition scale away from /i 2 = m\. However, much of the type of possible variation in F£ is 
very similar to that obtained by the variation of parameters already described. 

A variety of different choices defined in Tabled] has been tried at NLO and at NNLO, along 
with the ZM-VFNS (at NLO)Q The resulting variations in F£(x, Q 2 ) near the transition point 
due to different choices of GM-VFNS at NLO are shown in the left of Fig. [TJ (A similar variation 
under changes in scales and scaling variable has appeared in [21].) I also define an "optimal" 
scheme which is chosen to be smooth at threshold, at NLO, and reduces to exactly the right 
limits at high and low Q 2 . There is quite a spread in the values of F^(x,Q 2 ) at NLO, though 
the ZM-VFNS is far steeper at low Q 2 than any GM-VFNS. The continuation of the FFNS 
result above Q 2 = m| is also shown at NLO, and it is clear that at both x values all GM-VFNS 
variations are ultimately rising above this. In the vicinity of Q 2 = m 2 the optimal scheme stays 
closest to FFNS. The spread is very much reduced at NNLO, the right of Fig. [1] with almost 

1 Some of the results from this were presented in [25] . 
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1 2 3 456789 Q 2 1 2 3 456789 Q 2 

Figure 1: The variation in F%{x, Q 2 ) generated from a variety of choices of GM-VFNS at NLO 
(left) and NNLO (right) using the MSTW2008 pdfs in each case. 

zero variation until very small x. At this point the precise cancellation between different terms 
in the expression for the GM-VFNS near the transition point is starting to fail to some extent, 
showing that NNLO evolution effects are most important in this regime and even higher orders 
in the GM-VFNS are needed for there to be very little sensitivity to choices. The FFNS is 
not shown in this case as the NNLO ((9(a|)) contribution to this is only approximate and it is 
possible, and arguably likely, that the approximation is worse away from the Q 2 ~ m\ regime 
where it is important in the GM-VFNS. 

Global fits are also performed using the same procedure as the MSTW2008 fit [2H] for all 
schemes, and the value of as(M|) is allowed to vary in all cases. At NLO the initial x 2 f° r a new 
GM-VFNS can change by up to 250, but converges to within 20 of the original with refitting. 
The fit is improved for options 1, 2 and 6, and the optimal scheme and the fit is best for the 
option 6, where it is 23 units lower than for the best fit in |29j - in particular the quality of fit to 
the HERA F% data improves from 110 to 90 for 83 points. For option 2, the fit quality is almost 
unchanged. For options 4 and 5 the values of b and d respectively are limited by the quality 
of fit of the HERA F 2 C data, i.e. we stop variations when the fit quality reaches 7 units higher 
than the best fit for MSTW2008. The variations in the partons extracted at NLO are shown 
in the left of Fig. |2j The default TR' scheme sits near the low end at small x values. Some 
changes in PDFs exceed the one a experimental PDF uncertainty in the upwards direction at 
small x. as(Mz) changes by < 0.0007 for all variations of the GM-VFNS, only about half the 
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Figure 2: The variation of the gluon (upper plots) and quark (lower plots) PDFs at Q 2 = 
10, OOOGeV 2 obtained from the best fit from a variety of choices of GM-VFNS, the ZM-VFNS 
and the FFNS at NLO (left) and the GM-VFNS variations at NNLO (right) as a ratio to the 
MSTW2008 PDFs. The one-sigma uncertainties for the MSTW2008 PDFs are shown as dash- 
dotted lines. 



PDF set 


Tevatron 




LHC 


(7 TeV) 


LHC 


(14 TeV) 




a z (nb) 


CTf/(pb) 


a z (nb) 


er#(pb) 


a z (nb) 


0\ff(pb) 


MSTW2008 


7.207 


0.7462 


27.70 


12.41 


59.25 


40.69 


GMvarl 


+0.3% 


-0.5% 


+0.7% 


-0.1% 


+ 1.1% 


+0.2% 


GMvar2 


+0.7% 


-1.1% 


+2.0% 


+0.5% 


+3.0% 


+ 1.5% 


GMvar3 


+0.1% 


-0.3% 


+0.7% 


+0.4% 


+1.1% 


+0.8% 


GMvar4 


+0.0% 


-0.1% 


-0.3% 


-0.1% 


-0.4% 


-0.2% 


GMvar5 


-0.1% 


-0.1% 


-0.4% 


-0.2% 


-0.5% 


-0.3% 


GMvar6 


+0.3% 


-0.4% 


+ 1.0% 


+0.3% 


+1.6% 


+0.8% 


GMvaropt 


+0.3% 


-1.5% 


+1.5% 


+0.1% 


+2.0% 


+0.4% 


ZM-VFNS 


-0.7% 


-1.2% 


-1.6% 


-1.8% 


-3.0% 


-3.1% 


FFNS - DIS only 


+3.1% 


-14.4% 


+4.5% 


-0.7% 


+7.0% 


+3.0% 


GMvarcc 


+0.0% 


-0.1% 


+0.0% 


-0.1% 


+0.0% 


-0.1% 



Table 2: Predicted cross-sections at NLO for Z and a 120 GeV Higgs boson at the Tevatron and 
LHC. 
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experimental uncertainty found in [30] G I also perform a fit using the ZM-VFNS. In this case 
the value of a s {M%) falls by 0.0015. The ZM-VFNS PDF is clearly outside the GM-VFNS band 
in some places. The fit quality is about 200 units worse than the MSTW2008 fit. 130 of this is 
the fit to the F£(x, Q 2 ) data which is very poorly fit at low Q 2 and much of the rest is from 
a worse fit to HERA structure function data. There is little difference in the fit if the charm 
structure function data are omitted. 

It is also interesting to investigate a fit in the FFNS scheme. In this case it is not possible 
to perform a full global fit since the coefficient functions are not fully known for hadron-hadron 
collider processes even at NLO. It would be possible to perform a hybrid fit where for structure 
functions we stayed with the three flavour description but for collider data let the same partons 
evolve in a variable flavour scheme. However, this seems a peculiar compromise - either one 
attempts to resum the logarithms or one does not. Hence, I consider a FFNS fit to structure 
function data only, also omitting the high-Q 2 charged current HERA data, which also rely on not 
fully known coefficient functions at order a 2 s . The resulting fit has x 2 — 1944/2089, compared to 
x 2 = 1878/2089 for the MSTW2008 fit, i.e. the contribution of the the y 2 from these data to the 
full MSTW2008 fit, not a new fit to the restricted data set. A new fit using the default GM-VFNS 
gives an improvement of about 30 units and only minor changes in PDFs. The improvement is 
mainly due to some reshuffling of the flavours with no Drell Yan data constraint allowing a better 
fit to the fixed target DIS data. There is little change for HERA data. The fit using the FFNS 
has a value of as(M§) of 0.1187 when it is evolved from low scales in a variable flavour scheme, 
0.0025 lower than the MSTW value. This lower value allows a better fit to BCDMS data, but 
there is a deterioration in the fit to NMC data, to the nuclear target neutrino DIS data and the 
the HERA data at higher Q 2 where F 2 (x,Q 2 ) does not rise as quickly with Q 2 as in the GM- 
VFNS. The fit to F|(a;,Q 2 ), for which the most precise data is at relatively low Q 2 , is slightly 
better in the FFNS, but this is far from compensating for the deteriorations elsewhere. The 
partons obtained in the FFNS fit are qualitatively different to those in the GM-VFNS fit. The 
gluon distribution is larger at small- x and correspondingly lower at high-x. This is qualitatively 
very similar to the difference between the gluon distribution in the ABKM PDF set [23] (or 
more recently [25]), which uses the FFNS and the MSTW set, as is the change in the coupling 
constant. At small-x this difference in the gluon shape is reflected in the quarks at higher Q 2 
since their evolution is determined mainly by the gluon. The input PDFs in the FFNS fit can be 
evolved using a variable flavour scheme in order to make a more direct comparison at high-Q 2 
possible, and to examine the compatibility with collider data. The ratio to the MSTW2008 
PDFs can be seen in the left of Fig. [2j and it is clear they are qualitatively different. The 
predictions for Tevatron jet data are very poor, and there is a distinct deterioration in the fit 
quality for the Z rapidity data from CDF. In contrast the DIS-only fit using the GM-VFNS 
gives predictions for Tevatron jet data that have y 2 values only a few units higher than when 
these data are included, and the prediction for the Z rapidity data is actually marginally better 
than the global fit. Clearly at NLO the results from the FFNS and any GM-VFNS are rather 

2 A comparison between GM-VFNS definitions of different groups using a common set of PDFs can be found 
in |31j . and more complete comparisons are in progress. The differences are well- understood. At low Q 2 some of 
the differences are bigger than those exhibited in this article, but this is due to the different ordering chosen by 
different groups in the Q 2 < region which is a fixed part of our definition. 
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different. One would expect this difference to diminish at higher orders, but a precise comparison 
awaits the full calculation of FFNS coefficient functions. 

The variation in GM-VFNS is also applied to the charged- current cross-section expressions. 
None of the variations in a, b, c, d of the form already considered lead to changes in the x 2 °f 
much greater than one unit even before refitting. The change in the PDFs with refitting is largest 
for the strange quark, which is probed by dimuon production which relies on the heavy flavor 
formalism, but is always very much smaller than 1%. This is due to the fact that the changes in 
structure function with change in scheme definition are by far most significant at small x, and the 
charged current data is all for x > 0.01. It is also because in the charged- current case one only 
produces a single heavy particle in the final state, i.e. the threshold W 2 = m\ rather than 4m| 
so there is less sensitivity to how the massless limit is approached than the neutral current case. 
Finally, the data on dimuon production is a combination of the structure functions F 2 (x,Q 2 ), 
Fl{x, Q 2 ) and F 3 (x, Q 2 ), each with a particular y dependence. In practice the contribution from 
the charm quark contribution and the corresponding gluon subtraction term are suppressed by 
a factor of (1 — y) 2 - For these data 0.3 < y < 0.7 so this factor is always significantly less than 1, 
and moreover the contribution with larger charm quark or subtraction are at larger y and most 
suppressed. H. 

The predictions for cross-sections are shown at NLO in Table. [2j It is assumed that any 
^( m cb/ m l_ff) corrections which would occur in a full GM-VFNS definition are negligible and 
the ZM-VFNS result is used. There is at most a 1.5% variation at the Tevatron. At the recent 
energy at the LHC of 7 TeV there is a variation of +2% to —0.4% in oz- This is increased 
to +3% to —0.5% at 14 TeV due to smaller x being sampled. The spread in a H at the LHC 
is about halved compared to oz due to the higher average x sampled. The ZM-VFNS is a 
clear outlier in the low direction at the LHC. However, the ZM-VFNS variations are only about 
3% at most compared to our default, and never more than 5% even compared to the most 
upward GM-VFNS variations. Hence, the ZM-VFNS change is more similar to that observed 
by NNPDF when adopting a GM-VFNS to obtain PDFs [26J, than the larger changes observed 
by CTEQ when adopting a GM-VFNS as default for the first time [32]. The FFNS predictions 
are also shown, and display some rather large variations, particularly in the Tevatron Higgs 
cross section. However, it should be remembered that this is a fit to a more limited data set 
than all the others. It is, however, indicative of the major differences seen between MSTW2008 
PDFs and some PDFs obtained using fits using the FFNS. GMvarcc denotes variation in the 
GM-VFNS for charged current processes, and clearly the effect is very small indeed. 

For fits at NNLO the initial changes in \ 2 are much smaller than at NLO, i.e. < 20, and 
they converge to within 10 of the original. In fact they are all within 2-3 units of the default 
fit except for options 2 and 6 where the fit quality is about 10 worse. In fact for option 6 the 
variation of the d parameter is limited by the deterioration in the fit to the HERA F£ data 
in the NNLO fit rather than NLO. The variations in the PDFs extracted at NNLO are shown 
in the right of Fig. [2j At the very worst the changes approach the one a experimental PDF 

3 There is the related issue of how diagrams where the final state charm quark is generated away from the 
parton-boson interaction vertex should be considered in dimuon production, which is done differently by different 
groups and related to the acceptance corrections. This should be clarified, but is a separate topic to the particular 
scheme variations considered in this article. The suppression noted above makes the effect very small, however. 
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Figure 3: The variation in PDFs at Q 2 = 10, OOOGeV 2 obtained from two choices of GM-VFNS 
at NNLO where the coefficient of the unknown constant in the small- a; term in the 0{a\) part of 
the FFNS coefficient function is taken to its extreme value. In one case it is frozen for Q 2 > m 2 
and in the other it falls like m 2 /Q 2 . 

uncertainty, but are usually far less. Variations in ots(M|) are ~ 0.0003. Hence, the stability of 
the PDFs to variations in the parameters a, b, c, d defining how quickly the heavy flavour PDF 
turns on, and correspondingly the size of subtraction terms, diminishes greatly at NNLO due to 
increased cancellation of the two types of term. 

However, at NNLO there is another source of uncertainty associated with the GM-VFNS 
definition. The TR' scheme models the C?(a|) FFNS terms at low Q 2 using known leading 
threshold logarithms [33] and ln(l/x) terms [3lj. The small-x model takes the form 

A(Q 2 /m 2 h )(l - z/ XmBX ) & Qn(l/z) - b)/z, (21) 

where A(Q 2 /rnfy is a calculated function, but a and b are free parameters determining at what x 
the leading ln(l/x) contribution becomes dominant. The default values are a = 20, b = 4, but a 
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Table 3: Predicted cross-sections at NNLO for Z and a 120 GeV Higgs boson at the Tevatron 
and LHC. 

and b can be varied. Changes in a within reasonable limits make little difference. The maximum 
sensible variation of b = 2 (determined by the fact that the fit quality is not significantly altered, 
similar to the variations of a-d) leads to an effect of order the experimental PDF uncertainty at 
x < 0.001, as shown in Fig. [3j However, this change in the PDFs is largely eliminated if the 
contribution dies away like m|/Q 2 , rather than being frozen, as also seen in Fig. |3j This 
is another reason for preferring this choice. 

The changes in predictions for cross sections at NNLO are seen in Table. [31 Other than 
model dependence - GMvarmod denotes the variation to b = 2 in the 0{a\) term - which 
reaches slightly more than 1% for 14 TeV at the LHC, the maximum variations are of order 0.5% 
at LHC. GMvarmod' is when the O{og) terms fall with Q 2 , and clearly exhibits a very small 
deviation. Hence, as the changes in PDFs essentially guaranteed, there is far less uncertainty 
in predictions for cross sections due to uncertainties in definitions of the GM-VFNS at NNLO 
than at NLO, exactly as we would expect for a theoretical ambiguity due only to the finite order 
at which one is working. The reduction in uncertainty is exactly as (usually) observed for scale 
variations. 

As a point of detail I note that at even at NNLO the smoothness across the transition point 
exhibited in the right of Fig. [1] is not ideal. This is not so surprising at x = 10~ 4 since as 
noted a new ln(l/x) divergence in the evolution of the heavy quark is introduced into the quark 
evolution at NNLO which is not effectively cancelled by the subtraction terms. However, at 
x = 0.001 the slope immediately above the transition point is slightly flatter than below it. This 
is the same for all variations of the GM-VFNS, so in fact is not a feature of the definition of a 
GM-VFNS at all. It can be traced to the behaviour below Q 2 = m|, i.e. to the modelling of the 
0(a|) coefficient function. In the small-x part of this in Eq. ( 121]) it is assumes that the Q 2 /m| 
dependence of the constant term b is the same as the known ln(l/x) term. Modifying this by 
a rather slowly varying factor of (Q 2 /ml) ' 1 results in the rather smoother behaviour shown in 
Fig. HJ Hence, this might be thought of as an improvement to this C(a|) coefficient function. 
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Figure 4: The variation in F£(x, Q 2 ) generated from a variety of choices of GM-VFNS at NNLO 
where the low Q 2 modelling has been modified in one, improving the smoothness at the transition 
point (left-hand plots). In the right-hand plots only this choice is shown for clarity. 

However, it affects the structure functions only below Q 2 = m 2 for the charm contribution, i.e. 
below the MSTW2008 Q 2 cut, and the influence from the change in the very small amount of 
F% for Q 2 < ml is very minor. Even without refitting the global fit changes by only about 1 
unit, and so no significant change in PDFs occurs. More important, but very similar in practice, 
is an improvement in threshold corrections which could be made in the same term. Since the 
publication of the MSTW2008 PDFs more logarithms in the threshold correction have been 
calculated, see [35J. These can be included in the 0(a\) coefficient function (though some 
approximation is needed since in (35] the value of one coefficient is presented only for specific 
m 2 /Q 2 values, though the full result very recently appeared in [36]). The result is that this 
improvement itself makes the behaviour at the transition point smoother, and leads to a small 
modification of the structure function both below, and at the transition point. The change at 
the transition point then persists as a constant if this term is frozen, though becomes relatively 
less important very rapidly, or quickly disappears if this term is defined to fall away like m\jQ 2 . 
Hence, even though this improvement will be included in future schemes, and is beneficial to the 
detailed behaviour of the structure function, within this definition of an NNLO GM-VFNS the 
effect on the PDFs is extremely small. 

Finally, it is an interesting question whether the changes in the GM-VFNS have any influence 
on the value of the quark masses preferred by the MSTW2008 fits, as presented in [12] . As noted 
in this previous study there is little constraint on m& from fits, and indeed there is very little 
sensitivity to the GM-VFNS definition. However, there is a distinct effect on the conclusions 
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regarding m c (see also [37J). The default MSTW2008 choice of charm mass m c = 1.4GeV is 
chosen to be near the best fit value at NLO, which turns out to be m c = 1.45GeV with a 
very marginally better fit quality than m c = 1.4GeV. The same default mass is used in the 
studies in this article to enable direct comparison to MSTW2008 results. However, when m c 
is allowed to vary using the optimal scheme at NLO the best fit value is for m c = 1.32GeV, 
and the quality of the fit is 11 units better than for m c = 1.4GeV and 17 units better than 
the best fit for m c = 1.45GeV using the default scheme. The fit does prefer the smoother 
behaviour near the transition point in the optimal scheme, but the slower turn on of F£ at low 
Q 2 results in a preference for a lower mass. The variation with scheme is much less at NNLO, and 
correspondingly the variation in the best fit mass is also smaller. It changes from m c = 1.26GeV 
in [12] to m c = 1.23GeV using the optimal scheme. The best fit quality for the preferred mass 
is 7 units lower using the optimal scheme at NNLO. Hence, the difference in the values for the 
pole charm mass extracted at NLO and NNLO has reduced from 0.19GeV in [12], to 0.09GeV 
using the optimum scheme, and both NLO and NNLO fits are within their uncertainty for m c 
for an intermediate value of m c = 1.28GeV. Compatibility is certainly improved, but the values 
are rather low, a trend also seen for the charm mass defined in MS scheme extracted from fits 
to structure function data in [21]. There is some evidence in this study that the convergence 
of the perturbation series in the FFNS is quicker when using the MS scheme definition of the 
mass than when using the pole mass as is used in this study and is currently the most common 
framework. It would certainly be interesting to investigate this in the context of GM-VFNS 
analyses. However, the situation is not quite the same in a GM-VFNS. The first dependence 
on scheme occurs for coefficient functions at order a|. Hence, for many GM-VFNS definitions, 
e.g. any ACOT variation and the default NNPDF choice, the mass-scheme dependence would 
not set in until NNLO. For our definition the as coefficient functions only play a minor role 
at low Q 2 , being either frozen at Q 2 = m| or falling away quickly in the new optimal scheme. 
Hence, the mass-scheme dependence will be less influential at NLO in the GM-VFNS than in 
the FFNS, particularly in the optimal scheme. At NNLO the a 2 s coefficient functions are used at 
all scales, so the mass-scheme dependence will come into play fully. However, at low Q 2 where 
the mass-scheme dependence is undoubtedly most important there is the additional uncertainty 
due to the lack of the full 0(a s ) coefficient functions, so it is uncertain how much information 
the mass-scheme dependence can currently provide. 

To summarise, I have considered variations in the definition of a GM-VFNS within the 
framework used in MSTW, and various other global fits for PDFs. In particular I have noted 
that none (in any framework) currently reduce to exactly the appropriate order FFNS expression 
at low Q 2 and also the appropriate order ZM-VFNS expression as Q 2 /m 2 — > oo, though I have 
shown this can be easily achieved in the TR' scheme. I have investigated the variation of both the 
heavy flavour structure function obtained and the PDFs extracted when 4 parameters controlling 
the definition of the GM-VFNS are varied within limits set by fit quality or sensible behaviour at 
NLO and NNLO. At NLO, variations in the resulting PDFs can be similar to the experimental 
PDF uncertainties at small x, and this is interpreted as a theoretical uncertainty. As such it is 
probably most appropriate to add this to the experimental PDF uncertainty linearly. However, 
as with other theoretical uncertainties it is impossible to gauge a confidence level. In principle a 
smooth GM-VFNS choice which also gives the best fit should provide the central value. I would 
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then consider the full variation of GM-VFNS definitions in this article as somewhat conservative, 
and very likely to be more than a one sigma uncertainty. At NNLO there is much greater stability, 
showing the success of the GM-VFNS approach, and the theoretical uncertainty is much smaller 
than the experimental PDF uncertainty. One of these GM-VFNS choices is extremely smooth 
near the transition point, a particular improvement at NLO, and reduces to exactly the correct 
asymptotic limits. It also gives a better fit quality, particularly at NLO, and less variation in 
the preferred m c value between NLO and NNLO. This so-called "optimal" scheme seems to 
be a preferred option for global fits in future than the current default scheme used in the TR' 
definition, which was chosen originally simply for simplicity and continuity, and I recommend 
this new alternative for future global PDF fits. 
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